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A framework for categorizing entropic measures of nonclassical correlations in bipartite 
quantum states is presented. The measures are based on the difference between a quantum 
entropic quantity and the corresponding classical quantity obtained from measurements on 
the two systems. Three types of entropic quantities are used, and three different measurement 
strategies are applied to these quantities. Many of the resulting measures of nonclassical 
correlations have been proposed previously. Properties of the various measures are explored, 
and results of evaluating the measures for two-qubit quantum states are presented. 



I. INTRODUCTION 

Maxwell demons observe a physical system and use the information obtained to extract work 
from the system [Ij. For multipartite systems, we can distinguish quantum Maxwell demons, which 
have knowledge of the entire density operator and can manipulate and make measurements on the 
joint system, from classical demons, which can only perform operations and make measurements on 
the subsystems of the multipartite system. Because a single classical demon cannot be everywhere 
at the same time, it must recruit local demons to gather, process, and use information about 
the local systems; thus it is better to think of a classical demon as a collection of local demons. 
These local demons might or might not be allowed to communicate with each other using classical 
channels. The amount of work that the two kinds of demons, quantum and classical, can extract 
from a given multipartite quantum state by employing protocols within each demon's means is a 
way of comparing quantum-information-processing protocols with classical ones. 

This demonology is but one of several attempts [H 141412] to track down and quantify the 
correlations that exist in multipartite quantum states. The nonclassical part of these correlations 
is not just quantum entanglement, even though entanglement is a part of it. The open question 
of pinning down why mixed-state quantum algorithms can solve certain problems exponentially 
faster than the best known classical ones [13], even in the absence of any significant entanglement, 
is one of the main motivations behind studying the nonclassical correlations in quantum states 
other than entanglement [T4l - [l8] . 

We consider only bipartite states in this paper. For our numerical work, the discussion is 
specialized yet further to states of two qubits. Correlations between systems can be quantified 
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in terms of correlation coefRcients and covariance matrices or in terms of entropic measures like 
mutual information. We choose the latter approach as the preferred one in information theory. The 
aim of the paper is to formulate a framework in terms of which the several entropy-based measures 
of nonclassical correlations that have been proposed can be classified and understood. Constructing 
the framework leads to two new measures we have not seen previously in the literature. The focus 
here is not so much on unifying various measures, as in Ref. [12], but rather on clarifying the 
relationships among them. 

The setting for our framework is two systems, A and with a joint quantum state pab- We 
consider three types of nonclassical-correlation measures, Ai{pAB)i between A and B: 

1. Mutual-information-based measures. 

2. Conditional-entropy-based measures. 

3. Demon-based (joint-entropy-based) measures. 

The type-2 correlation measures can be asymmetric between A and B because conditional entropy 
is typically asymmetric. 

As Landauer pointed out, when talking about demons, erasure of the demon's memory — and the 
associated thermodynamic cost — is an essential feature for assessing what a demon can do [19]. As 
we mentioned above, a classical demon that works on a bipartite quantum system is best thought 
of as two local demons working in concert. Whether the two demons can communicate impacts 
their ability to cooperate. So the demon-based measures are thus further divided into two classes: 

i. Erasure without communication between the demons. 

ii. Erasure with communication between the demons. 

All the measures of nonclassical correlations we consider here are constructed as the difference 
between a quantum entropic measure, Q{pab)j and its classical counterpart, C{pab), which is 
derived from the probabilities for results of local measurements on one or both of the subsystems. 
The thinking behind this construction is that Q quantifies some notion of all the correlations in the 
system, whereas the corresponding classical C captures only the corresponding classical correlations. 
The difference, Ai = Q — C, is therefore a way of quantifying the nonclassical correlations in the 
quantum state. 

The results of local measurements are all that local classical observers (demons) can access, 
and these measurement results are used to probe the correlations (if any) between A and B. We 
do not want, however, our measure of nonclassical correlations to depend on the specifics of the 
measurement performed. Hence, in its construction, the classical measure, C{pab), is maximized 
over all possible measurements within specific measurement strategies that are defined beforehand. 
In some instances, when maximization is necessary, we are able to show that the maximum is 
attained on rank-one POVMs; in other cases, we restrict the maximization to rank-one POVMs. 
We give a full discussion of these different situations and the issues surrounding rank-one POVMs 
after we have developed our framework. 
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We thus imagine that there are classical observers A and B — demons or otherwise — who have 
access to the two parts of the bipartite system. We allow these observers to employ one of three 
measurement strategies: 

a. Local, rank-one-projector measurements in the eigenbases of the marginal density operators. 

b. Unconditioned local measurements. 



c. Conditioned local measurements. 



For strategy (a), the local measurements are unique modulo degeneracies in the marginal density 
operators. The other two strategies require maximization of the classical measure C over the 
measurements allowed by the strategy. The first two measurement strategies do not require the 
observers to communicate with each other, but the last one does. Consequently, the first two 
strategies are symmetric between A and B. For the third strategy, A performs a measurement and 
communicates the result to B, who can then condition his measurement on the result communicated 
by A. This makes the nonclassical correlation measures that are based on the third measurement 
strategy asymmetric between A and B. 

We now have three types of correlation measures and three measurement strategies, and we can 
label the resulting correlation measures with the type and the strategy. For example, Aiib refers to 
the nonclassical correlation measure constructed as the difference between quantum and classical 
mutual informations, where unconditioned local measurements are used to construct the classical 
mutual information. 

There is a natural hierarchy in the three types of measurements strategies. Allowing arbitrary, 
unconditioned local measurements, as in strategy (b), is a restriction of the conditioned local 
measurements of strategy (c), since to get (b) from (c), observer B simply chooses to ignore any 
communication A might have sent regarding her measurement results. Likewise, measuring in the 
local eigenbases of the marginal density operators, as in strategy (a), is a restriction of the arbitrary, 
unconditioned local measurements of strategy (b). Thus, when we maximize over the measurements 
in a particular strategy, the classical measure C cannot decrease — and generally it increases — as 
we move from (a) to (b) to (c). This is saying that the more general the measurements the local 
observers are allowed to do, the more they can expect to discover about any classical correlations 
that exist between the subsystems. Since our nonclassical-correlation measure Ai is the difference 
between Q and C, Ai cannot increase — and generally it decreases — as we move from (a) to (b) 
to (c), i.e., Mja > Mjb > Mjc for j = 1, 2, 3. 

In Sec. [n] we formulate our framework: Sec. |II A| reviews the bipartite entropic information 
measures that we use in constructing our framework; Sec. II B spells out the description of local 
measurements for strategies (a)-(c); Sec. II C defines the nonclassical-correlation measures and 
discusses relations among them; and Sec. IIP considers the issues raised by assuming the local 
measurements are described by rank-one POVMs and also whether one can specialize further to 
measurements described by rank-one projectors. In Sec. Ill we present numerical results comparing 
the various measures for two-qubit states, assuming that the local measurements can be described 
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by orthogonal rank-one projection operators. A concluding Sec. |IV| draws attention to outstanding 



questions, and several appendices provide additional information. 

II. FRAMEWORK FOR ENTROPIC MEASURES OF NONCLASSICAL 

CORRELATIONS 

In this section we develop our framework for measures of nonclassical correlations and explore 
properties of the various measures the framework leads to. 

A. Entropic measures of information and correlation 

Entropic measures of information quantify how much information can be extracted from a 
system or, more poetically, how much information is "missing" about the fine-grained state of 
the system. Figure [l] is a useful pictorial representation of the relationships among the entropies 
and entropic measures of correlation that apply to bipartite systems. The figure provides an 
accurate representation for classical entropies. In the quantum case, some of the quantities cannot 
be represented or are misrepresented by this diagram, but even so, the diagram is a useful tool 
because it captures correctly the relationships among the various entropies. 

For a bipartite state pab of systems A and the quantum entropic quantities that will be 
used in the ensuing discussion are the following: 

1. S{A^ B) = S{pab) = —^^{PAB log Pab)i the joint von Neumann entropy of the whole system. 

2. S{A) = S{pa) = —t^AiPA^og pa) and S{B) = S{pb) = —t^BiPB^ogps), the von Neumann 
entropies of the marginal density operators. 

3. S{B\A) = S{A, B)-S{A) and S{A\B) = S{A, B)-S{B), the quantum conditional entropies. 

4. S{A : B) = S{A) + S{B) — S{A,B), the quantum mutual information, which is related to 
the quantum conditional entropies by S{A : B) = S{B) - S{B\A) = S{A) - S{A\B). The 
quantum mutual information can also be written as a quantum relative entropy, 

S{A:B)^S{pab\\pa(E>Pb) , (2.1) 
where the relative entropy is defined by 

Sip\\a)^-Sip)-tv{ploga) . (2.2) 

Local measurements on the bipartite quantum system are described by a joint probability 
distribution pab for outcomes labeled by a and b. Bayes's theorem relates the joint, conditional, 
and marginal distributions: Ph\aPa = Pah = Pa\bPb- These distributions are used to define the 
classical information measures: 

1. H{A, B) = H{pab) = — "^abPab logPa65 ^he Shannon entropy of the joint distribution pab- 



H{A) H{B) 



FIG. 1. (Color online) The (red) circle on the left denotes the entropy associated with system A\ the (blue) 
circle on the right denotes the entropy associated with system B. The area on the right filled in with 
(blue) dots is the information missing about B given complete information about A\ this area denotes the 
conditional entropy H{B\A). Similarly, the area on the left filled in with the (red) grid denotes H{A\B). 
The overlap between the two circles, filled with horizontal lines, denotes the mutual information H{A : 5), 
which is the information contained in A about B and vice versa. The combined envelope of the two circles 
is the joint entropy H{A, B). From the diagram, we have H{B\A) = H{A, B) - H(A) = H{B) - H{A : B) 
and H{A\B) = H{A,B) - H{B) = H(A) - H{A : B). For a classical joint probability distribution, the 
entropic measures are all Shannon entropies or relative Shannon entropies — thus they are guaranteed to be 
nonnegative — and they are related as the diagram depicts. For a bipartite quantum state, the joint quantum 
von Neumann entropy, S(A, B), and the marginal von Neumann entropies, S(A) and S(B), replace H(A, B), 
H{A), and H(B). The measures are related as depicted in the diagram, because the quantum conditional 
entropies, S(B\A) and S(A\B), and the quantum mutual information, S(A : B), are defined by these 
relations. The difference is that S{B\A) and S(A\B), as so defined, can be negative, and thus the quantum 
mutual information S(A : B) can be bigger than the marginal entropies, S(A) and S(B), and bigger than 
the joint entropy S(A,B). 

2. H{A) = H{pa) = -T^aPa ^^gPa and H{B) = H{pi,) = -Y^aPh logP6, the Shannon entropies 
of the marginal distributions, Pa and p^. 

3. H{B\A)^H{A,B)-H{A) = T.aPaH{B\a) and H{A\B)^H{A,B)-H{B) = Y.aPbH{A\h), 
the classical conditional entropies. H{B\a) = — X^^p^ia logP6|a and H{A\b) = — Yfj^Pa\h ^^ZPa\h 
are the Shannon entropies of the conditional distributions p^^j^ and p^^j^; the conditional 
entropies are averages of H{B\a) over pa and H{A\b) over p^. 

4. H[A : B) = H[A) + H{B) — H{A,B) = 'Y^i)Pab^oE>{Pab/PaPb)j the classical mutual infor- 
mation. H{A : B) is the relative information of the joint distribution pab with respect to the 
product of the marginals, PaPb^ 



H{A:B)^H{pab\\PaPb); 



(2.3) 
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the classical relative information, which is always nonnegative, is defined by 

H{Pj\\qj) = Y^PJ ^^dPj/Oij) = -H{pj) - Y^Pj logqj . (2.4) 
j j 

We also have H{A : B) = H{B) - H{B\A) = H{A) - H{A\B). 

Figure [l] summarizes the relations among the classical entropies; it works because the classical 
conditional entropies and the classical mutual information are all nonnegative. This leads to several 
inequalities that can be read off Fig. [ij For example, we can see that 

max(iJ(A), H{B)) < H{A, B) < H{A) + H{B) . (2.5) 

The lower bound on H{A^ B) is saturated when knowing one subsystem completely determines 
the other (the two circles in Fig. [l] are either identical or become nested), i.e., H[A : B) = 
min [H{A)^H{B)). The upper bound is saturated when there are no correlations between A and 
5, i.e., H{A : B) = 0, so determining one subsystem gives no information about the other (the two 
circles in Fig. [ijare disjoint). For quantum entropies the lower bound in Eq. (2.5) does not hold, 
which is equivalent to saying the quantum conditional entropies can be negative. The simplest 
counter-example is a two-qubit Bell state: the joint state is pure and, hence, has zero entropy, but 
the marginal states are completely mixed, so their entropies are maximal and both equal to one. 



B. Local measurements 

We now spell out the general description of the local measurements that applies to measurement 
strategies (a)-(c). Although we only need measurement statistics — and, hence, only need POVMs — 
to evaluate the classical entropic measures, we start our description with quantum operations, 
partly to be general and partly so we can deal with post-measurement states in a subsequent 
discussion of Maxwell demons. 

The measurement on A is described by quantum operations [20j that are labeled by the possible 
outcomes a of the measurement on A: 

A = ^^aa0 4a- (2-6) 

a 

The quantum operation is applied to a density operator by inserting the density operator in place 
of the 0. The operators Aaa, the Kraus operators of Aa^ combine to give the POVM element for 
outcome a, 

Ea^Y.'^aaAaa, (2.7) 

a 

and the POVM elements satisfy a completeness relation, I a = X^^^a- 

The absence of communication in strategies (a) and (b) makes them quite straightforward. The 
measurement on B is described by a set of quantum operations, 

Bb = Y.Bh^GBl. (2.8) 

13 



7 



These give POVM elements 

Fb = ^BlB,^, (2.9) 

which satisfy a completeness relation Ib = i^6- The state of the joint system after measurements 
with outcomes a and b is pAB\ab = ^ Bh{pAB) /Pah^ where 

Pah = tr(A ^ Bi,{pab)) = tT{Ea FtpAB) (2.10) 

is the joint probability for outcomes a and b. The post-measurement joint state and the joint 
probability marginalize to the subsystems in the standard way. 

We need to be more careful with strategy (c) because of the communication from A to B. We 
handle strategy (c) in a general way that allows us to interpolate between (b) and the extreme case 
of (c) in which every outcome a leads to a different measurement on B. We do this by introducing 
a set C whose elements c label the possible measurements to be made on B. We let A stand for 
the set of outcomes a, and we define a function c(a) that maps an outcome a to the corresponding 
value in C. We let Ac = {a\ c(a) = c} be the subset of A that leads to the B measurement labeled 
by c. The subsets Ac partition A into disjoint subsets. We can regard C as another variable 
in our analysis; it is a coarse graining of the measurement on A. Formally, we have that C is 
perfectly correlated with A, i.e., Pc|a = ^c,c(a)? implying that H{C\A) = and H{A : C) = H{C). 
Should there be only one possible measurement on B^ i.e., only one value of c, then there is no 
communication, and the situation reduces to strategy (b). The extreme case of (c) corresponds to 
having a different value of c for each outcome a, in which case there is no difference between the 
outcome set A and the set C. 

The state of the joint system after the measurement on A yields outcome a is PAB\a — 
Aa{pAB)/Pa, where 

Pa = tr{Aa{pAB)) = ^^A{EaPA) (2.11) 

is the probability for outcome a. The state of system 5, conditioned on outcome a, is 

PB\a = ^^A{pAB\a) = ^ ' I (2.12) 

Pa 

notice that this is determined by the POVM element Ea- The probability for making measurement c 
on B follows formally from 

Pc = "^PclaPa = '^Pa = tTA{EcPA) • (2.13) 
a aGAc 

Here we introduce coarse-grained POVM elements for the measurement on A, labeled by the 
measurement to be made on B: 



Ec — ^ Ea 

aeAc 



(2.14) 
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Notice that if there is only one possible measurement on i.e., only one value of c, then Ec = Ia] 
when there is a different measurement for each outcome a, the POVM elements Ec are the same as 
the POVM elements Ea. We also have the state of B conditioned on the coarse-grained outcome c: 

Pb\c = ^ • (2.15) 

Pc 



Notice that Eqs. ( |2.12p and ( |2.15p imply that 

PB = "^PapBla = "^PcpBlc • (2-16) 



c 



We turn our attention now to the measurements on B. We let B stand for the set of all outcomes 
on B for all the possible measurements on B. We define a function c{b) that maps an outcome 
b to the measurement c in which it occurs, and we define Be = {b \ c{b) = c} to be the subset 
of B outcomes for the measurement labeled by c. The subsets Be partition the set of all possible 
outcomes on B into disjoint subsets. We again have perfect correlation, i.e., Pc|6 = ^c,c{b)^ implying 
that H{C\B) = and H{B : C) = H{C). 

The measurement on B that is labeled by c is described by quantum operations 

The Kraus operators give the POVM elements for this measurement, 

Fh\c = '^^l(3\c^bp\c , (2.18) 

and these satisfy a completeness relation Ib = X^^^Bc ^b\c- 1^ sums over 6, we can let the sum run 
over the outcomes of all the possible measurements on B by the artifice of defining B^/^ic = for 
b ^ Be and, hence, F^j^ = for b ^ Be- 

The state of the joint system, conditioned on outcomes a and 6, is 

A f3^c{a) (PAb) 13^c{a) {PAB\a) , 

PABlab = = 7 (2.19) 

Pab Pb\a 

where 

Pab = tr(A ^ Bb\c{a){pAB)) = tT{Ea ^ F^e{a)PAB) = PatTB{Fne{a)PB\a) (2-20) 

is the joint probability for a and b and 

Pb\a = ^^{^b\c{a){pAB\a)) = ^^{F^\c{a) PB\a) (2.21) 

is the conditional probability for b given a. Notice that Pab and p^j^ are nonzero only if 6 G ^c(a) 
or, equivalently, only if a G ^c(6)- 

For our purposes, it is easier to work with the coarse-grained outcomes c, which specify the 
measurements on B. Indeed, the joint probability for b and c is 

Pbc = ^Pc\abPab = ^Pab = ^liEc ^ F^^^^PAb) = Pc^T B{F^ePB\c) • (2.22) 
a aeAc 
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Notice that is nonzero only if 6 G ^c- Thus the conditional probability of b given c takes the 
form 

Pb\c = ^ = tiB{F^cPB\c) , (2.23) 

Pc 

and the unconditioned probability for b is 

Pb = "^Pbc = tr(^c(6) ^6|c(6)PAb) = Pc(6)trs(i^6|c(6)PS|c(6)) • (2-24) 
c 



C. Measures of nonclassical correlations 



In this subsection we formulate our framework for entropic measures of nonclassical correlations, 
considering in turn the three types of measures introduced in Sec. |T] and for each type, the three 
local measurement strategies, (a), (b), and (c). For strategy (a), the local measurements are in 
the eigenbases of the marginal density operators. For strategies (h) and (c), we assume that the 
measurements are described by rank- one POVMs, which means that Ea and F^^ are multiples of 



rank-one projection operators. We discuss this assumption in Sec. |IID 

To compare and relate the various measures, we rely on two inequalities that relate the quantum 
and the classical entropies: the POVM inequality (see Appendix [A| for a proof) and the ensemble 
inequality [20] . 

The POVM inequality relates the quantum entropy for a state p to the classical entropy for 
probabilities pj = ti{Ejp) obtained from (nonzero) POVM elements Ej: 

H{Pj) + Y.Pi log(tri;,) ^-Y^Pj log(^) > S{p) . (2.25) 

3 3 \ 3/ 

A rank-one POVM is one such that all the POVM elements are rank-one, i.e., Ej = pjPj, where Pj 
is a rank-one projection operator and < pj = tiEj < 1. The trace of the completeness relation 
implies that pj = (dimension of the quantum system). For a rank-one POVM, we have 

H{pj) > S{p) - J2 PJ log Mi > Sip) . (2.26) 

3 

The ensemble inequality [20] says that the Shannon information of a set of ensemble probabilities 
Qj exceeds the Holevo quantity of the ensemble: 

H{q,) > s(J2QjP?) -YljSipj) . (2.27) 

^3 ' 3 

For strategy (a), where the local measurements are in the eigenbases of the marginal density 
operators, we have immediately that i^(A) = S{^A) and H{B) = S{B). For both (b) and (c), we can 
apply the POVM inequality in its rank-one form to pa = trl^EaPA) to conclude that H{A) > S{A). 
Similarly, for strategy (b), the POVM inequality applied to = tr(F5p^) gives H{B) > S{B). 
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For strategy (c), we need a chain of inequalities to conclude that H{B) > S{B): 

H(B) = H{C, B) = H{C) + H{B\C) 

^H{p,) + Y,PcH{B\c) 

C 

> H{p,) + Y,PcS{PB\c) (2.28) 

C 

> s{^PcPB\}j (2.29) 
= S{pb) = S{B) . (2.30) 



The first inequality (2.28) is a consequence of applying the POVM inequality to Eq. (2.23), the 



second inequality (2.29) is an example of the ensemble inequality, and the final equality uses 



Eq. (2.16). 



1. Type 1: Mutual-information-based measures 

For type-1 measures, we choose Qi = S{A : B) and Ci = H{A : B), giving the difference 
measure 

Ml = S{A : B) - H{A : B) . (2.31) 

We now apply the three measurement strategies introduced in Sec.|T|to obtain the classical mutual 
information H{A : B); this leads to three different type-1 measures. 

For strategy (a), the local measurements are made in the eigenbases of the marginal density 
operators, and this gives a nonclassical-correlation measure that we denote by A^i^. If the marginal 
density operators have nondegenerate eigenvalues, the marginal eigenbases are unique; in the case 
of degeneracy, one needs to maximize H{A : B) over the rank-one, project ion- valued measurements 
in the degenerate subspaces to get a unique measure Aiia- The measure Aiia was introduced by 
Luo in [lOj and called there the measurement-induced disturbance (MID). The same measure, in a 
different guise, had been proposed by Rajagopal and Randall in [8]; they defined what they called 
the quantum deficit as H{A, B) — S{A^ B), where H{A, B) is obtained from measurements in the 
marginal eigenbases. The quantum deficit and MID are the same because they differ by the terms 
H{A) — S{A) and H{B) — S{B), which are zero for measurements in the marginal eigenbases. 

When strategy (b) is used, we obtain the measure 

Mib = S(A : B) - maxiJ(A : B) , (2.32) 

(b) 

where the classical mutual information has to be maximized over the unconditioned local mea- 
surements of strategy (b). The maximum classical mutual information was introduced in [9J as a 
measure of classical correlations, and the same paper suggested M.ib as a measure of nonclassical 
correlations. This measure was investigated in detail by Wu, Poulsen, and M0lmer (WPM) in [llj, 
and we refer to it as the WPM measure in this paper, while denoting it as M-ib- The optimal un- 
conditioned local measurements are not necessarily orthogonal-projection- valued. An example of a 
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case in which the maximization requires POVMs and not just projective measurements was given 
in [n]; we review and extend this example in Appendix [Cj In addition, the optimal local measure- 
ments do not generally occur in the marginal eigenbases, which implies that A^i^^^^ > Ai^^^^\ 
For strategy (c), the classical mutual information H{A : B) can be made arbitrarily large, thus 
allowing Aiic to be arbitrarily negative. This is easy to see by considering the extreme case of (c) 
in which every outcome a leads to a different measurement on system then, as noted in Sec. |IIB[ 
H{A : B) = H{A), which can be as big as desired by giving the measurement on A an arbitrarily 
large number of outcomes. We conclude that Aiic has nothing to do with quantifying nonclassical 
correlations, so we drop Aiic from our array of possible measures. 

2. Type 2: Conditional- entropy-based measures 

For type-2 measures, we choose Q2 = —S{B\A) and C2 = —H{B\A). The result is the difference 
measure 



We notice immediately that 



M2 = H{B\A) - S{B\A) . (2.33) 



M2=Mi+ [H{B) - S{B)] > Ml . (2.34) 



This shows that a type-1 measure is always less than or equal to the type-2 measure that uses the 
same measurement strategy, with equality only when B is measured in the marginal eigenbasis. 

Measurements in the eigenbases of the marginal density operators have H{B) = S{B), so for 
strategy (a), we have M.2a = Aiia^ and our measure is again MID. 

Strategy (b) gives the measure 

M2b = mmH{B\A) - S{B\A) , (2.35) 
(b) 

where we have to minimize H{B\A) over all unconditioned local measurements. We can conclude 
from general considerations that A^i^^^^ = -^20^^^^ ^ -^26 ^ Ai^^^^\ Notice also that the 
unconditioned local measurements that minimize H{B\A) need not be the same as those that 
minimize H{A\B). This means that A^26 is intrinsically asymmetric between subsystems A and B 
even though the measurement strategy is symmetric. 
Strategy (c) gives the measure 

M2c = mmH{B\A) - S{B\A) . (2.36) 

(c) 

The POVM inequality immediately gives a bound on H{B\A), 

H{B\A) = ^p„/7(i?|a) > Y.paS{B\a) = H{e^^{B\A) . (2.37) 

a a 

When we are allowed to make conditional measurements on 5, the bound can be achieved by 
measuring 5, for outcome a, in the eigenbasis of PB\a' Hence, with the conditional measurements 
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on B specified, the minimization of the classical conditional entropy, H{B\A), is reduced to choosing 
a measurement on A that minimizes the conditional entropy H^E^j{B\A): 

mm H{B\A) = rmnH{E^yiB\A) ^ H{B\A) . (2.38) 

The quantity H{B\A) is a special sort of classical conditional entropy. The resulting measure is 
the quantum discord [21 [6] : 

M2c = H{B\A) - S{B\A) = V{A B) . (2.39) 

Here we also introduce a notation for discord that emphasizes explicitly its asymmetry between A 
and B. In Appendix [c{ we exhibit joint states that show that to find the minimum H{B\A) — and, 
hence, to find the quantum discord — sometimes requires rank-one POVMs, not just orthogonal- 
project ion- valued measurements. 

Henderson and Vedral [7J introduced the quantity J{A B) = S{B) — H{B\A) as a mea- 
sure of classical correlations. Ollivier and Zurek [6J considered J{A B) to be an asymmet- 
ric, measurement-based version of the mutual information and thus defined quantum discord as 
V{A B) = S{A : B) - J{A B) = -S{B\A) + H{B\A) = M2c- In particular, Ollivier and 
Zurek did not define discord in terms of conditioned measurements on 5, but rather assumed that 
the quantity to be minimized over measurements on A is the conditional entropy H^E^y{B\A). 

We can conclude from general considerations that A^i^^^^ = •^20^^^'' ^ ^ M.^^'^'^^^K Our 
present considerations do not, however, provide an ordering of the WPM measure and quantum 
discord. We return to the ordering of WPM and discord in Sec. IIC4 and show in Appendix [B] 



3. Type 3: Demon-based measures 



Type-3 measures quantify the difference in the work that can be extracted from a quantum 
system by quantum and classical demons. The demons extract work by transforming the initial joint 
state pAB to the fully mixed joint state using any means at their disposal, including measurements. 
We assume here that all states of the system have the same energy so that all the work that the 
demons extract arises from the entropy difference between the initial and final states of the system; 
it is natural to choose ksT lii2 as the unit of work. Throughout the paper, whenever we talk about 
extract able work and erasure cost, we actually mean average work and average erasure cost. 

The maximum work that can be extracted by a quantum demon by any means is given by the 
entropy difference between the initial and final states, 

W,^logidAdB)-SiA,B) , (2.40) 

where dA and ds are the dimensions of the two subsystems. The demon could extract this amount 
of work by devising an optimal process that directly transforms the joint state pab to the maximally 
mixed state. It could, instead, make a measurement in the joint eigenbasis of pab, extract work 
log{dAdB) as the post-measurement pure eigenstate is transformed to the maximally mixed state. 
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and then pay a price S{A, B) to erase its memory of the S{A^ B) bits acquired in the measurement. 
The demon would then be ready to pick up another copy of the system and repeat the process. 

In contrast to a quantum demon, a local, classical demon can only manipulate the subsystem 
in its possession. In Sec. |I|we introduced two cases for the local demons that are dealing with our 
bipartite system. In case (i) the two demons are not allowed to communicate with each other. In 
this case, the maximum amount of work demon A can extract from subsystem A is logd^ — S{A). 
This can be achieved by an optimal process that directly transforms the marginal state pA to 
the maximally mixed state or by measuring in the marginal eigenbasis, extracting work logd^ as 
the post-measurement pure state is transformed to the maximally mixed state, and then erasing 
the S{A) bits of measurement record at cost S{A). Since demon B is in the same situation, the 
maximum work the two local demons can extract is 

Wc = logidAde) - SiA) - SiB) . (2.41) 

The difference in tlie amount of work that can be extracted by the quantum and classical demons, 
called the work deficit [3-5J, is the quantum mutual information: 

Wg-Wc = SiA) + SiB) - S{A, B) = SiA : B) = Msii) ■ (2.42) 

Brodutch and Terno [5J have noted that the work deficit in the case of erasure without commu- 
nication between the local demons provides an operational interpretation of the quantum mutual 
information. 

In case (ii) the local demons can communicate their measurement results and thus reduce their 
cost of erasure. In particular, the demons make local measurements, which in accord with the 
assumptions of this section are described by rank-one POVMs and thus leave the two subsystems 
in pure states. They can then extract work 

= logdA + logdB (2.43) 

as their respective systems are transformed to the maximally mixed state. They must then erase 
their memories of the measurement record so they are ready to handle another copy of the joint 
state pab- 

In the absence of communication, the total erasure cost is W~ = H{A) +H[B) > S{A) + S{B), 
with the minimum attained for measurements in the marginal eigenbases; the net work the demons 
can extract is that of case (i), i.e., Wc = W+ — W~ = log dAdi) — S{A) — S{B). If the demons 
can communicate, however, as in case (ii), then they can take advantage of correlations between 
their measurement results to reduce their erasure cost to the joint classical information in their 
measurement records, W~ = H{A,B), which gives net work 

Wc = W^ - W- = logidAds) - H{A, B) . (2.44) 

Thus in case (ii), the work deficit becomes 



Wq-Wc^ H{A, B) - S{A, B)^Ms, 



(2.45) 
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giving us joint-entropy-based measures of nonclassical correlations, with Q3 = —S{A,B), C3 = 
-iJ(A,5),and M3 = Q-C. 

We now have to consider the three measurement strategies for the local demons, but before 
embarking on that, we note that 

X3 = X2 + [H{A) - S{A)] =Mi+ [H{B) - S{B)\ + [H{A) - S{A)\ , (2.46) 

so for each measurement strategy, we have A^3 > M2 ^A^i, as we have noted earlier. 

For strategy (a), measurement in the marginal eigenbases, we have H[A) = S{A) and H{B) = 
S{B), so we again get the MID measure, i.e., Aisa = = ^la; this is the form in which 

Rajagopal and Randall [8] defined what they called the quantum deficit. For strategy (b), we have 
to minimize H{A^ B) over all unconditioned local measurements, 

M^b = min H{A, B) - S[A, B) ; (2.47) 
(b) 

in general, the result is not the same as A^26 oi" M^ih- 

For strategy (c), we have to minimize H{A,B) over all conditioned local measurements. The 



minimization over the conditioned measurements on B is simple, since as in Eq. (2.37), we have 



H{A, B) = H{A) + H{B\A) > H{A) + ^p„5(i?|a) = H{e^^{A, B) , (2.48) 

a 

with equality if and only if the measurement on B, given outcome a, is in the marginal eigenbasis 
of PB\a- Hence, with the conditional measurements on B specified, the minimization of the classical 
joint entropy, H{A, B), is reduced to choosing a measurement on A that minimizes the joint entropy 
H{EjA,By. 

min HiA,B) = mm H^e^ {A, B) = H (A, B) . (2.49) 

(c) {Ea} 

The quantity H{A, B) is a special sort of classical joint entropy. The resulting measure of nonclas- 
sical correlations is 

M^c = H{A,B)-S{A,B) . (2.50) 

This measure was hinted at in Zurek's original paper on discord [2J. Ollivier and Zurek ^ defined 
quantum discord as the quantity A^2c5 but Zurek [4] resurrected M.2>c as a modified form of discord 
in his paper on discord and Maxwell demons. Brodutch and Terno [5] have also pointed out that 
M^c is the measure that applies to demons that can communicate and use strategy (c) for their 
measurements. Hence, we can call Alsc the demon discord (dd). 
As noted in Sec. |l| we have Mf^^ > Msb > M^^J^l 
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4- Properties of nonclassical- correlation measures 

The following array neatly summarizes the measures of nonclassical correlations that we have 
found and the relations we have found among them: 

_^(MID) ^ _^(WPM) 

II Al 

A^^r^^ > M2b > Xg") (2.51) 
II Al Al 

SiA,B) = A^3(i) > A^ir^ > Ms, > Mif 

The vertically oriented inequalities are best read by leaning your head to the left; in the absence 
of leaning, the wedges point toward the smaller quantity, as is standard. 

Of the potential measures we started with, the demon-based measure that assumes erasure 
without communication is special and gives the quantum mutual information. Of the remaining 
nine potential measures, we discarded one, Aiic, as meaningless; we found that the three measures 
in the left column of the array are all identical to the MID measure; we determined that three of 
the other measures are the WPM measure, quantum discord, and demon discord; and we are thus 
left with two new measures, A42b and although Aisb is very closely related to — and perhaps 

identical to — a discord-like measure introduced by Modi et al. [T2] . 

Modi et al. [12j introduced a set of measures of quantum and classical correlations based on 
the relative-entropy distance (2.2) between a multi-partite state p and the nearest state ap that is 



diagonal in a product basis, or between p and the nearest product state. The only one of these 
measures relevant to our discussion is their "discord," which when specialized to bipartite states, 
is the distance X^Modi = ^^^aAB '^{pabIWab)^ where aAB is diagonal in a product basis. Modi et 
al. show that the minimum is attained on a state obtained by projecting pab into a product basis, 
i.e., aAB = J2a,b\^ciJb){eaJb\pAB\eaJb){eaJb\, in which case, S{pAB\\crAB) = S{(jab) - S{pab)- 
Thus we have 

^Modi = min S{(jab) - S{pab) • (2.52) 

Since S{aAB) is the classical joint entropy of a measurement made on pab in the product basis 
1^05/5)5 this would be the same as our if we knew that the optimal local measurements for 
Aisb were described by orthogonal rank-one projectors. 

Brodutch and Terno [5j define three kinds of "discord": their Di is the standard discord 
^(discord). ^YieiT D2 is the demon discord Alg^*^^; and their Ds is a discord-like quantity that 
uses a different conditional measurement strategy. This strategy allows conditioned local measure- 
ments, but with the measurement on A constrained to be in the marginal eigenbasis of pA- The 
Brodutch- Terno measurement strategy is a restriction of strategy (c), and (a) is a restriction of 



the Brodutch- Terno strategy. Measures based on it could thus be placed in the array (2.51) as an 
alternative intermediate column whose ordering with strategy (b) is indeterminate. 



All the measures in the array (2.51), except the quantum mutual information, are bounded 



above by MID, and MID is bounded above by the quantum mutual information. Similarly, MID. 
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A^265 aiid Aisb are bounded below by both the WPM measure and the quantum discord, and the 
demon discord Aisc is bounded below by discord. The WPM measure and quantum discord have 
a special status in that they are the most parsimonious of the measures in quantifying nonclassical 
correlations. 

WPM showed that their measure is nonnegative, and Datta ^16j showed that discord is nonneg- 
ative, allowing us to conclude that all the other measures are also nonnegative. Both proofs rely 
on the strong subadditivity of quantum entropy [20]; we review the proofs in Appendix [Bj Careful 
consideration of the conditions for saturating the strong-additivity inequality [21], not presented 
here, give the conditions for WPM and discord to be zero: the WPM measure is zero if and only 
if Pab is diagonal in a product basis, i.e., an orthonormal basis of the form \ea) \fb), and discord 
is zero if and only if pab is diagonal in a conditional product basis (pointing from A to B), i.e., an 
orthonormal basis of the form \ea) \fb\a)- 

Since MID, like WPM, is zero if and only if pab is diagonal in a product basis, the relations 



in the array (2.51) imply that A42b and Aisb are zero if and only if pab is diagonal in a product 



basis. Similarly, the inequality M^^f^ > Ad^^^^^^^"^ shows that having pab diagonal in a conditional 



product basis is necessary to make Adsc zero, and a moment's contemplation of Eqs. ( 2.48 )-( 2.50) 
shows that this is also a sufficient condition. 

For pure states, we have S{A,B) = 0, S{A) = S{B) = -S{B\A) = -S{A\B), and S{A : B) = 
2S{A) = 2S{B). It is easy to show that the optimal measurement for all the measures in the array is 
measurement in the Schmidt basis of the pure state (marginal eigenbasis for each subsystem) , which 
gives H{A) = H{B) = H{A,B) = H{A : B) = S{A) = S{B) and H{B\A) = H{A\B) = 0. Thus 
all the measures in the array, except the quantum mutual information, are equal to the marginal 
quantum entropy, S{A) = S{B), which is the entropic measure of entanglement for bipartite pure 
states. 

The remaining gap in our understanding left by the relations in the array is whether there is 
an inequality between the WPM measure and discord. The WPM measure is strictly bigger than 
zero for states that are diagonal in a conditional product basis that is not a product basis and so 
is bigger than the quantum discord for such states. If there is an inequality, it must be that the 
WPM measure is bounded below by quantum discord. Indeed, it is not hard to come up with a 
proof, using the method of Piani et al [9J . The proof, given in Appendix [Bj is part of the two-step 
demonstration that WPM and discord are nonnegative. We conclude that 

m'^^™^ > X^f ^"■■'^^ = V{A ^ B) . (2.53) 

The proof allows us to identify the equality condition: the WPM measure is equal to discord if 
and only if pab is diagonal in a conditional product basis that points from B to A. 

We emphasize that product bases and conditional product bases do not exhaust the set of 
orthonormal bases that are made up of product states. There are orthonormal bases made up 
entirely of product states that are neither product bases nor conditional product bases; these have 
been studied, for example, in the context of nonlocality without entanglement Not surprisingly, 
we refer to such a basis as a basis of product states, to be distinguished from a product basis or a 
conditional product basis. 
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D. Rank-one POVMs and projective measurements 

In Sec. |II C| we assumed that all the measurements were described by rank-one POVMs. This 



assumption does not affect the demon-based work deficit (2.42) for the case of erasure without 
communication, for that case, which leads to the quantum mutual information, does not rely 
on any assumptions about how the subsystems are measured. Nor does this assumption affect 
MID, which is derived from measurement strategy (a), a strategy that from the outset prescribes 
orthogonal-projection- valued measurements in the eigenbases of the marginal density operators. 
The assumption must be carefully examined, however, for measurement strategies (b) and (c). On 
the face of it, there is a problem for the second and third rows of our array. For type-2 measures, the 
task is to minimize a classical conditional entropy, and for type-3 measures, the task is to minimize 
a classical joint entropy. In both cases, the minimum is achieved by making no measurements 
at all. 

For the demon-based measures in the right two columns of the third row, it is clear what the 
problem is. The contribution of H{A, B) to the classical work comes from the erasure cost; the 
local demons can minimize their erasure cost by not having a measurement record. Of course, if 
the local demons make no measurements, they also cannot extract the work attendant on knowing 



more about their system's state. The upshot is that formula (2.44) for the net classical work needs 
to be modified if one does not assume measurements described by rank-one POVMs. Appendix [P] 
shows, not surprisingly, that, once modified, the net classical work is always optimized on rank-one 
POVMs, so one can restrict the demons in this way without affecting their performance. 

For the measure A^26? we know of no reason to restrict to rank-one POVMs more compelling than 
declaring that the measure would be nonsense without this restriction. For the quantum discord, we 
can do better: the original definition of discord [6j did not discuss conditioned measurements on 
but rather formulated the discord directly in terms of minimizing the classical conditional entropy 



as in Eqs. (2.37)-(2.39); this is equivalent to our assuming rank-one POVMs for the measurement on 
B. We are still left with a question — why should the measurements on A be restricted to rank-one 
POVMs? — and this same question applies to both local measurements for the WPM measure. We 
now address this question by showing in both situations that the optimum can always be attained 
on rank-one POVMs. It is important to show this, because the proofs regarding nonnegativity and 
ordering of the WPM measure and discord, given in Appendix [Bj assume rank-one POVMs. 

We deal with the WPM measure first. The key point is obvious: making coarse-grained POVM 
measurements on A and B should not uncover as much mutual information as making fine-grained, 
rank-one POVM measurements. We start with POVMs {Ea} and {F5} for systems A and B, and 
we imagine that these are a coarse graining of POVMs {Eaj} and {F^/c}? i-^., 

Ea = Yl ^-J ' Fi, = Y,Fbk- (2.54) 



A POVM element can always be fine-grained to the rank-one level by writing it in terms of its 
eigendecomposition. The joint probability for the fine-grained outcomes aj and bk is Pajbk = 
Pjk\abPab^ with similar relations for the marginals for the two subsystems. It is now trivial to show 
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that fine graining never decreases the classical mutual information: 

H{A, J ■.B,K)^ H(A : B) + ^pabH{J : K\a, b) . (2.55) 

a,b 

This means that in maximizing the classical mutual information, we need only consider rank-one 
POVMs. 

For the discord, the reduction to rank-one POVMs has been demonstrated by Datta [16]; it 
is sufficiently brief that we repeat it here. Since the conditional measurements on B are already 
specified, we need only worry about fine graining the measurement on A. We need the conditional 
state of B given the coarse-grained outcome a in terms of the conditional states given the fine- 
grained outcome aj: 

= = = Z^Pj\aPB\aj , (2.56) 



' Pa ^ Pa 



where 



Pa tlA[EapA) 



The quantity to be minimized over measurements on A is the conditional entropy (2.37). For it. 
we can write 



^Y.P»^{Y.Pi\<^PB\aj) (2.58) 

a J 

> J2P-^S(PB\aj) = H{E^MB\^) ' (2.59) 

where the inequality follows from the concavity of the von Neumann entropy. Thus fine graining 
never increases this conditional entropy, so we are assured that the minimum is attained on rank-one 
POVMs. 

We have now settled the question of restricting to rank-one POVMs for all the measures except 
the measure in the middle, and for it, we simply assert that it makes sense only if we restrict 

to rank-one POVMs. A remaining question is whether we can further restrict to orthogonal- 
project ion- valued measurements. Searching over the entire set of rank-one POVMs is a daunting 
task, considerably more onerous than searching just over projection- valued measurements. On this 
score, we can report that WPM drew attention to an example where the WPM measurements are 
optimized on a rank-one POVM that is not project ion- valued; we extend this example to quantum 
discord and generalize it in Appendix [C] 

These examples, however, require that at least one system have dimension bigger than two; there 
is currently no evidence that two-qubit systems require anything other than orthogonal projectors. 
In the next section, we do a wholesale evaluation of the various measures for two-qubit states; in 
keeping with the lack of evidence and the need for manageable numerics, we restrict the search 
over measurements to rank-one, orthogonal projection operators. 
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Several groups of investigators have considered Gaussian versions of nonclassical-correlation 
measures for Gaussian states of two harmonic-oscillator modes; the local measurements are re- 
stricted to Gaussian measurements, i.e., measurements whose POVM elements are the phase-space 
displacements of a particular single-mode Gaussian state. Giorda and Paris [23] and Addesso and 
Datta ^24j focused on a Gaussian version of discord and showed that the optimal Gaussian measure- 
ments are rank-one POVMs, but that for some Gaussian states, the optimal measurement is not 
orthogonal-projection- valued and thus requires the use of POVMs. Mista et al [25j investigated 
Gaussian versions of MID and WPM (which they called AmeriolatedMID); their investigation 
showed that for some Gaussian states, the optimal Gaussian measurement for the WPM measure 
is not the globally optimal measurement when one allows nonGaussian POVMs. 

III. NUMERICAL RESULTS FOR TWO-QUBIT STATES 
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B) plotted against M.^^^^^ for one million randomly generated joint density 



matrices, using orthogonal projectors for the measurements. As expected, the WPM measure is never smaller 
than the discord; also evident is that discord is zero for a larger class of states than the WPM measure, 
those being the states that are diagonal in a conditional product basis pointing from Aio B. 



One purpose of our framework is to clarify relations among the various measures of nonclassical 
correlations beyond entanglement. The ordering of the measures is of particular interest. The 
framework provides by construction some ordering relations between the measures; in addition, we 
have proved, using the method of Piani et al, the important relation that the WPM measure is 
bounded below by the discord. Nonetheless, questions remain, in particular, of whether there is 
an ordering between Ai2b and as well as between A^^^^^^ and A^3c*^^- 

In this section we illustrate and investigate the various orderings by presenting numerical evalu- 
ations of the several measures for randomly selected two-qubit states. It should be noted, however, 
that in order to do the optimizations over measurements numerically, we have had to restrict our- 
selves to orthogonal projectors instead of the more general POVMs, so in some situations, we might 
not be finding the optimal measurements. 
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FIG. 3. plotted against M-H^ for 100,000 randomly generated joint density matrices. Since 

A^3c^^ > A^2c^^^°^^\ points from Fig.[2]move upwards. Many points pass the diagonal, and the ordering 
of Fig. [2] disappears. 
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FIG. 4. A^3c plotted against M2b for 100,000 randomly generated joint density matrices. Relative to 
Fig. [3) the points move right, due to the relation M2b > Mib- Since not all of them pass the diagonal, there 
is no ordering relation between A^sc and M2b- 



To calculate the various correlation measures, we follow the approach of Al-Qasimi and 
James [26]. The measurement operators Ea and F5 are orthogonal projectors, 

Ea^\e^){e^\, Ffe-|ef)(ef|, a,6e{0,l}, (3.1) 
|e^) = cos0^|O) + e*'^''sin0^|l) , |ef ) = - sin0^|O) + e^'^'' cos0^|l) . (3.2) 

The required optimization is done by a numerical search over the angles {6>^, 0^} for X G {A, B}. 
For measurement strategy (b), we must search over the four angles for both qubits, but for strat- 
egy (c), we need only search over the two angles for subsystem A. 

Figure [2] compares the WPM measure and discord, confirming the expectation that the WPM 
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measure is never smaller than discord. Figures [3] and |4] display the aforementioned pairs of correla- 
tion measures where our framework does not imply an ordering relation; the numerical data show 
that there is no ordering for these pairs. 
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FIG. 5. (A) Discord (blue circles) and the WPM measure (yellow crosses) for one million randomly chosen 
two-qubit states, plotted against entanglement of formation, Ef. As the correlations increase, the spread 
between entanglement and WPM or discord decreases. (B) Two superimposed histograms showing the 
distribution of discord and the WPM measure for ranges of values of Ef. left histogram shows discord (red) 
and WPM (yellow) for the states of (A) corresponding to 0.1 < Ef < 0.2; right histogram shows discord 
(blue) and WPM (green) corresponding to 0.3 < Ef < 0.4. 



Another relation we have explored numerically is the one between the correlation measures and 
entanglement. Figure [5] shows discord and the WPM measure plotted against entanglement of 
formation, reproducing the plot in [26j for discord, but providing new data for the WPM measure. 
The entanglement of formation is calculated using Wootters's analytical expression [27J, Ef{p) = 
/i((l + Y^l — C^(p))/2) . Here h{x) is the binary entropy, h{x) = —xlogx — (1 — x)log(l — x), 
and C{p) is the concurrence, given by C{p) = max(0, Ai — A2 — A3 — A4), where the AjS are the 
eigenvalues in decreasing order of the operator ^/^/pp^/p, with p = {ay (8) (jy)p''{(jy ® (ly). 

To avoid the slow numerical optimization procedures, analytical expressions for the correlation 
measures would be desirable. Yet only for very restricted classes of joint states are such expressions 
available. Girolami et al. ^28j suggested that there is an analytical expression for the WPM measure 
for general two-qubit states. To understand their claim, we write the joint two-qubit state in terms 
of Pauli operators: 

1 ^ 
PAB = J (/as + a . o-^ ^ /b + /a ^ b • o-^ + ^ Cjkcrf ^ ) . (3.3) 

The correlation matrix, Cjk = tr (crj^ (g) cr^ pab)-, is not symmetric, but c can be diagonalized as 
(p-crrij = y^jTUj. The eigenvalues, A|, are the squares of the singular values of c, and the eigenvectors 
are the right singular vectors of c. The correlation matrix maps the right singular vectors to the 
left singular vectors, crrij = Ajrij, and the left singular vectors, rij, are the eigenvectors of cc?- . 
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FIG. 6. Deviation of the numerically obtained, optimal measurement vectors from the maximal singular 
vectors of the correlation matrix for the WPM measure. The joint states are a mixture of a pure product state 
with marginal spin (Bloch) vectors a = (1, 0, 0) and b = (l/v^, —1/2, 1/2) and a mixed Bell-diagonal (zero 
marginal spin vectors) state, with correlation matrix c = diag(— 0.9, —0.8, —0.7). The mixing parameter is 
e, with e = corresponding to the product state and e = 1 to the Bell-diagonal state. The green curve shows 
the cosine of the angle between the maximal right singular vector and the measurement vector on system B. 
The red curve is the cosine of the angle between maximal left singular vector and the measurement vector 
on system A. 



The claim of Girolami et al. was that the maximal left and right singular vectors, i.e., those 
corresponding to the largest singular value of c, specify the optimal measurements for the WPM 
measure. We can confirm that the measurement vectors for generic (randomly generated) two- 
qubit states are close to the maximal singular vectors of c for all three correlation measures that 
are based on measurement strategy (b), but it can be shown analytically that in general the 
singular vectors are not the optimal measurement vectors. Moreover, there are examples where the 
deviation becomes obvious in the numerics. Figure [6] shows an example where the angle between 
the measurement vectors and the maximal singular vectors is noticeable in the calculation of the 
WPM measure. Similar plots can be obtained for the measures A42b and Adsb- 



IV. CONCLUSION 



We consider in this paper several entropic measures of nonclassical correlations. All of these 
measures purport to quantify the degree of nonclassicality in bipartite quantum states. Just as 
important as the degree of nonclassicality, however, is the boundary these measures set between 
quantum and classical states. For the measures M.i^^^\ M.^^^^\ M.2b^ and M-sb^ a joint state 
is classical if and only if it is diagonal in a product basis; this notion of classicality is clearly a 
consequence of using measurement strategies (a) and (b). The measures ^(^^^^°^^) ^nd M.^^'^ both 
consider a joint state to be classical if it is diagonal in a conditional product basis pointing from A 
to B; this boundary between quantum and classical is a consequence of measurement strategy (c). 
The measures differ in the "amount" of nonclassicality they assign to states they deem not classical. 

The boundary between quantum and classical is important by itself, first, because it can usually 



23 



be extended to multipartite systems even when the measure of nonclassical correlations is not so 
easily extended and, second, because it serves as the basis for interesting questions about quantum- 
information processing. For example, Eastin [18j has recently investigated whether concordant 
computations can be simulated classically. A concordant computation is one such that after every 
elementary gate, the state of the whole computer is diagonal in a product basis. The entire 
computation is just a matter of changing the product basis, yet Eastin finds it difficult in general 
to simulate such computations efficiently. 

This discussion raises at least two other questions, the first of which concerns the boundary 
between quantum and classical. The boundaries induced by the nonclassical-correlation measures 
considered here are natural — naive is perhaps a better word — in that the classicality of a state 
is defined in terms of properties of its eigenvectors. This is quite different from the distinction 
between separable and entangled states [29], which pays no attention to the properties of the joint 
state's eigenvectors; a state is separable if and only if it has an ensemble decomposition — not an 
eigendecomposition — in terms of product states. The boundaries for the measures discussed in this 
paper do require a classical state to have unentangled eigenvectors, but they are more restrictive 
than saying that a state is classical if its eigenvectors are product states. There are orthonormal 
bases of product states that are neither product bases nor conditional product bases ^22j. The 
boundaries considered here, imposed by the measurement strategies, clearly have to do with wanting 
the product states in a joint eigenbasis to persist into the marginal eigenbasis of one or both 
subsystems. A natural question is wheher there is some other way of setting the quantum-classical 
boundary so that a joint state is classical if and only if it has unentangled eigenvectors? 

Any measure of nonclassical correlations assigns a number to a joint quantum state; the second 
question has to do with what this "amount" means. The demon-based measures have an operational 
interpretation as the work deficit suffered by local demons that are required to work only with the 
subsystems. Recently, there have been two closely related proposals for operational interpretations 
of quantum discord in terms of state- merging protocols [30l [3T]. Quantum discord has also been 
connected to the entanglement loss when mixed states are created from entangled states followed 
by entanglement distillation from those mixed states [32j. Such operational interpretations are 
essential to understand the meaning of a measure of nonclassical correlations. Whenever a measure 
of nonclassical correlations is proposed, the amount of nonclassicality must ultimately gain meaning 
through an operational interpretation. 
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Appendix A: The POVM inequality 



A quantum state written in its eigenbasis, 

P = ^K\ea){ea\ , 

a 

is subjected to a POVM with elements Ej. This gives outcome probabihties 

a 

where g^jo, = {ea\Ej\ea) is the probabihty for outcome j given state \ea){ea\' We define 

a 

which imphes that qj\a/Pj is a normahzed probabihty distribution on a. 
Now let f{x) = —xlogx and proceed through the following steps: 



j ^ a 



= - ^ qj\aK log Ac, - ^ Xaqjia log /ij 

a,j a,j 

= - X] Ac, log Aq, - logfij 

a j 

= S{p)-Y,Pjlog{trEj). 



(Al) 



(A2) 



(A3) 



(A4) 



(A5) 
(A6) 



The inequality (A4) uses that f{x) is a concave function and that qj\a/ Pj is a probability distri- 
bution over a. The step leading to Eq. (A5) uses that qj^f^ is a normalized distribution over j. 



Appendix B: Nonnegativity and ordering of the WPM measure and discord 

In this Appendix we show that the WPM measure is bounded below by quantum discord and 
that the quantum discord is nonnegative. We proceed by assuming that systems A and B are 
subjected to measurements described by POVMs with rank-one POVM elements Ea and F5, as 
in measurement strategy (b). For convenience, we use the fact that any set of POVM elements 
can be extended to rank-one orthogonal projection operators in a space of higher dimension, an 
extension called the Naimark extension [33]. Formally, we have an orthonormal basis \ea) in 
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the higher-dimensional space such that Ea = PA\ea){^a\PA, where Pa projects onto the original 
Hilbert space of system A, which is the space where the states of system A live; similarly, there is 
an orthonormal basis such that F5 = PB\fb){fb\PBj where Pb projects onto the original Hilbert 
space of system B. 

We can write the joint probability for outcomes a and b as 

Pab = t^{Ea^ FijPab) = tT(^PA\ea){ea\PA ^ PB\fb){fb\PBpAB^ = {eaJb\pAB\eaJb) , (Bl) 

where the last equality follows because pab lives in the original Hilbert space of A and 5, so we 
can discard the projectors into that space. Other results we need below include 

tTAjEgpAB) {ea\pAB\ea) . . 

PB\a = = , i^^j 

' Pa Pa 

where 

Pa = tT{EapA) = {ea\pA\ea) • (B3) 

We now extend the joint state pab to a space with two additional systems, C and D. We let C 
have dimension equal to the number of outcomes a, with an orthonormal basis \ga), we let D have 
dimension equal to the number of outcomes 6, with orthonormal basis \h})). The extended state, 

PABCD = \^ciJb){eaJb\pAB\ea'Jb'){^a',fb'\ ^ \9a){ga^ ^ \hb){hb^\ , (B4) 

a,a',b,b' 

can be regarded as a state where systems C and D record the measurement outcomes in their 
orthonormal bases. Notice that the entropy of the extended state is 

S\A,B,C,D) = S{A,B) . (B5) 

The proof follows from two applications of the strong-subadditivity property of von Neumann 
entropy [20j . Various marginals of the extended state and their von Neumann entropies enter into 
the proof: 

Pabd = Pa\ea){ea\ \fb){fb\pB\a\fb'){fb'\ ^ \hb){hb'\ , (B6) 

a,b,b' 

Pac = K){ea\pA\ea'){ea'\ ^ \9a){9a'\ , Pbd = Y \fb){fb\pB\fb'){fb'\ ^ \hb){hb'\ , (B7) 

a,a' b,b' 

Pab = X]Pa6|ea,/6)(ea,/6| , Pa = X]pa|ea)(ea| , Pb = X]P6|/6)(/6| , (B8) 

a,b a a 

PC = X]Pa|^a)(^a| , Pd = P6|/^6) (/^6| • (B9) 

a a 

These have von Neumann entropies 

S'{A,B,D) = H(A) + Y,PaS{B\a) , (BIO) 

a 

S'{A, C) = S{A) , S'{B, D) = S{B) , (Bll) 
S'{A, B) = H{A, B) , S'{A) = S'{C) = H{A) , S'{B) = S'{D) = H{B) . (B12) 
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The proof now comes in a rush. Recahing Eq. (2.37), we use the above results to write 



[S{A : B) - H{A : B)] - [H{E^yiB\A) - S{B\A)] 

= -S'{A, B, D) - S'(B) + S'{A, B) + S'(B, D) 

^ S'{A\B) - S'{A\B,D)>0 . (B13) 

The inequahty is precisely the expression of strong subadditivity for systems A, B, and D. It 
shows that 

m'^^™^ > = V{A ^ B) . (B14) 

Concentrating now on discord, we write 

H[E^}{B\A) - S{B\A) = -S\A, D) - S\A) + S\A, B, D) + S\A, C) 

= S'{B, D\A) - S'{B, D\A, C) > , (B15) 

where again the inequahty is strong subadditivity, this time for systems BD^ A, and C. This 
inequahty shows that discord is always nonnegative. 



The equahty conditions for strong subadditivity [2T] can be apphed to inequahties (B13) 



and (B15). From the latter inequality, we learn that j^^^^*^^^^ = 7)(^A B) is zero if and 



only if pAB is diagonal in a conditional product basis pointing from A to B. Datta [34], in the 
proof of his Theorem 2, has shown how to use the equality conditions for strong subadditivity 



to obtain this necessary and sufficient condition for zero discord. From (B13), we learn that 
^(WPM) _ ^(discord) _ j^^^j^ _^ •£ ^^^y is diagonal in a conditional product basis 

pointing from B to A. Combining these two results, we have that the WPM measure is zero if and 
only if PAB is diagonal in a product basis. 



Appendix C: Projective measurements vs. POVMs for WPM and discord 

In this Appendix, we elaborate an example given by WPM [llj, which exhibits a joint state 
that requires the use of rank-one POVMs, not just orthogonal-projection- valued measurements, 
to maximize the classical mutual information in evaluating the WPM measure. We extend these 
results to show that for the same states, rank-one POVMs are required for evaluating the discord. 

Consider a joint state 

ds dB 
PAB = ^PjPj ^ \ej){ej\ = ^PjPj ^ Pj , (CI) 

where the states \ej) make up an orthonormal basis for system B. The marginal states are given 

by 



j j 



(C2) 
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and this gives 

S{B) = H{pj) , (C3) 
S{A,B)^H{pj) + Y,PjS{pj), (C4) 

j 

S(A\B) = S{A, B) - S{B) = Y,PjS{pj) , (C5) 

j 

S{B\A) = S{A, B) - S{A) = H{pi) + Y.PjS{pj) - S{A) , (C6) 

j 

SiA : B) = SiB) - S{B\A) = SiA) - . (C7) 

j 

The quantum mutual information is the Holevo quantity for the ensemble of states pj with prob- 
abilities pj. This is not surprising because pab describes a situation where B sends a message to 
A: the message has the letters j, with probabilities pj; B keeps a record of the message in the 
orthogonal states \ej) and encodes the letters in the states pj. 

The state (CI) has zero discord when communication goes from B to A, i.e., V[B — ^ A) = 0, 
because pab is diagonal in a conditional product basis pointing from B to A. Generally, it has 
nonzero discord, T>[A B), when communication goes from A to B. The results of Appendix [B| 
show that the WPM measure equals V{A B) for such states. We return to discord below; for 
now, we focus on the WPM measure. Given any unconditioned, local measurements on A and 5, 
the joint probability for results a and h is 

dB 

Pah = "^PjPaljPblj , (C8) 
J = l 

where p^^j = tv{EaPj) and p^j^ = tv{Fi)Pj). We can think of Pah as the marginal of a joint distribu- 
tion for a, 6, and j\ 

Pabj = PjPa\jPb\j • (C9) 

That A and B are conditionally independent means that p^^j^^ = p^^j^, which implies that H{A\J) = 
H{A\B,J). Thus we have 

H{A : J) - H{A : B) = H{A\B) - H{A\J) = H{A\B) - H{A\B, J) > , (CIO) 

where the final inequality follows from classical strong subadditivity, which says that additional con- 
ditioning cannot increase the entropy. Measuring B in the eigenbasis \ej) gives H[A : B) = H[A : 
J), so we can conclude that the maximum mutual information is attained on this measurement. 
The WPM measure reduces to a form that only requires a maximization over the measurement 
on A\ 

^(WPM) ^ . _ . ^) _ . ^) _ maxiJ(A : J) . (Cll) 

(b) {Ea} 
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We now proceed to specialize the joint state pab in four ways. First, we assume that system A 
is a qubit and that the ensemble states, 

p, = ]^{lA + (T-n,) , (C12) 

are pure; i.e., the vectors rij are unit vectors. The measurement on A is described by rank-one 
POVM elements 

Ea = qa[lA + (T'ma) , (C13) 

where the vectors rua are unit vectors. The completeness relation for the POVM implies that the 
quantities qa make up a normalized probability distribution and that 

Y,qama = (). (C14) 

a 

The probability for result a, given state pj, is 

Pa\j = tr(£;aPj) = qa{l + Uj • nia) , (C15) 
and the joint probability for results a and j is 

Paj = tr(pABK ^ Pj) = PjtT{Eapj) = Pjga(l + rij • nia) • (C16) 
The second specialization is to assume that the states pj are distributed so that 

^p.nj = {) ^ pA = ^pjpj = -I^ . (C17) 

With this assumption we have that the probability for result a is pa = qa and thus that S{B\A) = 
H{pj) — S{A) = H{pj) — 1 and S{A : B) = S{A) = 1. The classical mutual information is 

H{A : J) = H{A) - H{A\J) = log(p„|,/pa) = Yl laFima) , (C18) 

a 

where we define the function 

dB 

F{m) = + rij ' m) log(l + rij • m) . (C19) 

The WPM measure is now given by M.^^^^^ = 1 — H{A : J), where 

H{A : J) = maxiJ(A : J) = max qaF{ma) . (C20) 

Before going on to the third specialization, let's consider the quantum discord when conditioning 



on A. We again make the first two specializations: a joint state of the form (CI), with A being 



a qubit, and the states pj being the pure states (C12), distributed according to Eq. (C17). We 
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measure the POVM (C13) on A. The probabihty for result a is pa = Qa, and the state of B. 



conditioned on result a, is 

PB\a = = 2^ ^^Pj = Z^Pji^ + • '^a)Pj , (C21) 

Pa Qa .^^ 

which has quantum entropy 

S{B\a) = - J2pj(^ + ' logfe(l + ' ^a)) . (C22) 
The conditional classical entropy that goes into the definition ( |2.38 ) of discord becomes 



H{B\A) = mmy2paS{B\a) = H{p,) - m^^ y^qaF{ma) = H{p,) - H{A : J) , (C23) 

SO the discord is given by 

V{A ^B)= ^^^^) = H{B\A) - S{B\A) = 1 - H{A : J) = X^™) . (C24) 

As expected, the discord is the same as the WPM measure for this set of joint states. 

Our third specialization is to assume that the ensemble probabilities are all equal, i.e., pj = 
1/dB, and the fourth, needed to work out examples, is that the vectors rij are symmetrically 
distributed, pointing to the vertices of an equilateral triangle or of a regular polyhedron. Within 
this configuration, we first maximize the function F{m). The high degree of symmetry guarantees 
that the extrema of F{m) occur along the symmetry axes of the triangle or polyhedron. Having 
determined the maximum value, Fmax, one knows that this maximum is an upper bound oiv H[A\ 
J). The high degree of symmetry further guarantees that one can make up a POVM out of the 
directions that give the maximum value, with qa chosen to be the same for all these directions; 
since this choice achieves the bound, one has H[A : J) = Fmax- Moreover, if no two of the 
directions rria are oppositely directed, the upper bound cannot be achieved with a projection- 
valued measurement. One ends up knowing, first, the common value of the WPM measure and 
quantum discord and, second, that the optimal measurement cannot be described by orthogonal 
projection operators. 

For a triangle (tetrahedron) of states, the maximum value of F is attained on the vectors that are 
directed opposite to the vectors that define the state. The maximum value is Fmax = log | for the 
triangle and Fmax = log | for the tetrahedron. The optimal measurement is the trine (tetrahedron) 
measurement that uses the triangle (tetrahedron) dual to the state triangle (tetrahedron). Both 
the WPM measure and discord are equal to 1 — log | = log | = 0.415 for the triangle and to 
1 — log I = log I = 0.585 for the tetrahedron. 

We stress that the examples in this Appendix require that subsystem B have three or more 
Hilbert-space dimensions. These examples thus do not exclude the possibility that projection- 
valued- measures suffice for WPM and discord for a pair of qubits. 

It is worth noting that if one asks about demon discord, the quantity one needs to evaluate. 



instead of being the classical conditional entropy (C23), is the classical joint entropy (2.49). Making 
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the first two specializations gives a demon discord 

) = min (H{qa) - ^ qaF{ma)] . (C25) 

{qa,ma}\ ^ J 

The additional contribution from H{qa) prejudices this minimum toward using a smaller number of 
outcomes for the measurement on A and thus toward orthogonal-projection- valued measurements. 
Indeed, for the triangle of states, with a trine measurement made in the dual triangle, the argument 
of the minimum is equal to 1. For an orthogonal-project ion- valued measurement, with mi pointing 
toward one vertex of the state triangle and m2 in the opposite direction, the argument is equal 
to I — ^log3 = 0.541, which thus becomes the demon discord. For this joint state, the optimal 
measurement for demon discord is orthogonal-project ion- valued. 



Appendix D: Demon-based measures and rank-one POVMs 



In this Appendix, we modify the formula for the net classical work to show that the local demons 
cannot do worse in terms of net classical work by restricting themselves to rank-one POVMs. 



We use the general measurement formalism of Sec. II B, which allows us to do strategies (b) 
and (c) simultaneously. The state of system A after a measurement yields result a is 

PA\a = , (Dl) 

Pa 

and the state of system B after measurements that yield outcomes a and b is given by 

Bb\c{a){pB\a) . . 

PB\ab = • [^^) 

Pb\a 

As the systems are transformed to the maximally mixed state, the local demons can extract work 

W+ = logidAds) - ^PaS{pA\a) " Pa65'(pB|a6) • (D3) 



a a,b 



The cost of erasing the measurement record, given communication between the demons, is = 
H{A^ B), giving a net classical work 

Wc^W+-W-^ logidAdB) - H{A, B) - Y.PaS{pA\a) " Y.P-i>^^PB\ab) ■ (D4) 

a a,b 

The measure of nonclassical correlations requires maximizing Wc over all possible measurements. 

After finishing the first round of measurements, instead of extracting work, the local demons 
can make further measurements in the eigenbases of and PB\ab- The overall measurement is 
now described by rank-one POVMs. After these measurements, the subsystems are left in pure 
states, so the local demons can extract work = \og{dAdB) as the systems are transformed to 
the maximally mixed state, but they have a more detailed measurement record, so their erasure 
cost is greater. If we let A^^j^^ be the eigenvalues of pA\a and A^|^5 be the eigenvalues of PB\ab^ then 
after these measurements, the new erasure cost is 

W- = HiA, B) + Y, PaH(K\a) + E PabHiXf^lab) ■ (D5) 

a a,b 
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Since the classical entropies of the eigenvalues are the same as the quantum entropies, the net 
classical work is the same as that given in Eq. ( D4 ) . The reduction in classical work from not using 
rank-one POVMs has been transferred to an increased erasure cost when making measurements 
described by rank-one POVMs. We conclude that the demons cannot do worse by restricting 



themselves to rank-one POVMs, thus justifying our assumption of rank-one POVMs in Sec. II C 
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